Regression with Multiple Candidate Models: Selecting or Mixing?

نویسنده

  • Yuhong Yang
چکیده

Model averaging provides an alternative to model selection. An algorithm ARM rooted in information theory is proposed to combine di erent regression models/methods. A simulation is conducted in the context of linear regression to compare its performance with familiar model selection criteria AIC and BIC, and also with some Bayesian model averaging (BMA) methods. The simulation suggests the following. Selection can yield a smaller risk when the random error is weak relative to the signal. However, when the random noise level gets higher, ARM produces a better or even much better estimator. That is, mixing is advantageous when there is a certain degree of uncertainty in choosing the right model. In addition, it is demonstrated that when AIC and BIC are combined, the mixed estimator automatically behaves like the better one. A comparison with bagging (Breiman (1996)) suggests that ARM does better than simply stablizing model selection estimators. In our simulation, ARM also performs better than BMA techniques based on BIC approximation. ARM is a computationally feasible way to combine models and/or non-model-based procedures. It is a convex combination of the original estimators with data-dependent weights. For the determination of the weights, the data is split into two parts. The rst one is used for estimation by each model or procedure and the accuracies of these estimators are assessed using the second half of the data. The accuracies are then used to assign the weights in a way such that a connection between function estimation and information theory ensures a desired theoretical capability of adaptation over di erent models and/or regression procedures.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Overview of the New Feature Selection Methods in Finite Mixture of Regression Models

Variable (feature) selection has attracted much attention in contemporary statistical learning and recent scientific research. This is mainly due to the rapid advancement in modern technology that allows scientists to collect data of unprecedented size and complexity. One type of statistical problem in such applications is concerned with modeling an output variable as a function of a sma...

متن کامل

Uncertainty Estimation in Stream Bed Sediment Fingerprinting Based on Mixing Model

Uncertainty associated with mixing models is often substantial, but has not yet been fully incorporated in models. The objective of this study is to develop and apply a Bayesian-mixing model that estimates probability distributions of source contributions to a mixture associated with multiple sources for assessing the uncertainty estimation in sediment fingerprinting in Zidasht catchment, Iran....

متن کامل

An Overview of Nonlinear Spectral Unmixing Methods in the Processing of Hyperspectral Data

The hyperspectral imagery provides images in hundreds of spectral bands within different wavelength regions. This technology has increasingly applied in different fields of earth sciences, such as minerals exploration, environmental monitoring, agriculture, urban science, and planetary remote sensing. However, despite the ability of these data to detect surface features, the measured spectrum i...

متن کامل

Flood Hydrograph Analysis Through Employing Physical Attributes Using Two and Multiple Variables Regression Factor and Cluster Analysis

Since direct experimental evidence is not available, this must be verified through a modeling approach, providedadequate data be available. Many statistical methods are used to study the relation between independent anddependent variables.This research was carried out at the western part of Jazmurian basin tlocated in the southeast ofIran. In this paperused ten physical characteristics such as ...

متن کامل

Mathematical Modeling and Analysis of Spark Erosion Machining Parameters of Hastelloy C-276 Using Multiple Regression Analysis (RESEARCH NOTE)

Electrical discharge machining has the capability of machining complicated shapes in electrically conductive materials independent of hardness of the work materials. This present article details the development of multiple regression models for envisaging the material removal rate and roughness of machined surface in electrical discharge machining of Hastelloy C276. The experimental runs are de...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999